feature(Types): added Experiment type encompassing Pipelines, metrics, project_name, etc. #114

almostintuitive · 2022-07-30T10:34:48Z

Closes #115
Closes #109

…, project_name, etc.

almostintuitive · 2022-07-30T10:35:49Z

run.py

+    save_remote: Optional[
+        bool
+    ] = None,  # If set True all models will try uploading (if configured), if set False it overwrites uploading of any models (even if configured)
+    remote_logging: Optional[


it looks like remote_logging was not used anywhere! did we have it wired up on latest main?

It is used in one place: in run.py!

logger_plugins = ( [ WandbPlugin( WandbConfig( project_id=project_id, run_name=config.run_name + "-" + pipeline.id, train=True, ), dict( run_config=config.get_configs(), preprocess_config=preprocess_config.get_configs(), pipeline_configs=pipeline.get_configs(), ), ) ] if config.remote_logging else [] )```

sorry, my mistake! I meant save_remote, not remote_logging!

almostintuitive · 2022-07-30T10:36:20Z

runner/runner.py

        self.plugins = obligatory_plugins + plugins

-        self.pipeline = overwrite_model_configs(self.config, self.pipeline)
+        self.pipeline = experiment.pipeline  # maybe we want to deepcopy it first?


question: should we deepcopy the pipeline first before we modify it?

I think you already may have asked this question

It can be deepcopied, although it won't make a difference inpractice because with each run we run the function aswell.

almostintuitive · 2022-07-30T10:37:21Z

type.py

    run_name: str  # Get's appended as a prefix before the pipeline name
    train: bool  # Weather the run should do training
    dataset: pd.DataFrame
+    pipeline: "Pipeline"


I needed to do forward declaration here, otherwise we're in for some dependency cycle fun!

almostintuitive · 2022-07-30T10:37:54Z

type.py

    run_name: str  # Get's appended as a prefix before the pipeline name
    train: bool  # Weather the run should do training
    dataset: pd.DataFrame
+    pipeline: "Pipeline"
+    metrics: Evaluators


metrics looks relatively useful here, as it may change based on the experiment?

almostintuitive · 2022-07-30T10:38:40Z

type.py

    run_name: str  # Get's appended as a prefix before the pipeline name
    train: bool  # Weather the run should do training
    dataset: pd.DataFrame
+    pipeline: "Pipeline"
+    metrics: Evaluators
+    preprocessing_config: PreprocessConfig


so actually there could be multiple preprocessing_configs, if we merge the data. so this is in theory incorrect, in practice, it's probably fine to pass in the one we want to log.

You mean in case there are multiple initial data sources?

right now, multiple ones that we merge together.
But later, there'll be multiple datasources, I completely forgot about that as well!

almostintuitive · 2022-07-30T10:40:27Z

The downside of this is that if I only want to change the pipeline we're running, but keep the rest intact, I'll need to do it in a loop or array comprehension to create the different Experiments.
But, I think that's fine. wdyt?

szemyd

👍 Looks good

feature(Types): added Experiment type encompassing Pipelines, metrics…

2771a7c

…, project_name, etc.

almostintuitive changed the title ~~feature(Types): added Experiment type encompassing Pipelines, metrics…~~ feature(Types): added Experiment type encompassing Pipelines, metrics, project_name, etc. Jul 30, 2022

almostintuitive commented Jul 30, 2022

View reviewed changes

almostintuitive marked this pull request as ready for review July 30, 2022 10:40

almostintuitive added 4 commits July 30, 2022 12:46

fix(Run): updated after Experiment type was added

f32cc94

feature(Experiments): created the relevant experiments

4bd9a48

fix(Dependencies): pyarrow needs to be 3.0 or above

85d48ff

fix(Models): preferred_loag_origin should be local

49277e0

szemyd approved these changes Jul 30, 2022

View reviewed changes

almostintuitive merged commit 76e04df into main Jul 31, 2022

almostintuitive deleted the feature/experiments branch July 31, 2022 17:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature(Types): added Experiment type encompassing Pipelines, metrics, project_name, etc. #114

feature(Types): added Experiment type encompassing Pipelines, metrics, project_name, etc. #114

almostintuitive commented Jul 30, 2022 •

edited

Loading

almostintuitive Jul 30, 2022

szemyd Jul 30, 2022

almostintuitive Jul 30, 2022

almostintuitive Jul 30, 2022

almostintuitive Jul 30, 2022

szemyd Jul 30, 2022

almostintuitive Jul 30, 2022

almostintuitive Jul 30, 2022

almostintuitive Jul 30, 2022

szemyd Jul 30, 2022

almostintuitive Jul 30, 2022

almostintuitive commented Jul 30, 2022

szemyd left a comment

feature(Types): added Experiment type encompassing Pipelines, metrics, project_name, etc. #114

feature(Types): added Experiment type encompassing Pipelines, metrics, project_name, etc. #114

Conversation

almostintuitive commented Jul 30, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

almostintuitive commented Jul 30, 2022

szemyd left a comment

Choose a reason for hiding this comment

almostintuitive commented Jul 30, 2022 •

edited

Loading